Similarity Search for Multi-dimensional NMR-Spectra of Natural Products

نویسندگان

  • Karina Wolfram
  • Andrea Porzel
  • Alexander Hinneburg
چکیده

Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring products is an important task to investigate new potentially useful chemical compounds. We develop a set-based similarity function, which, however, does not sufficiently capture more abstract aspects of similarity. NMR-spectra are like documents, but consists of continuous multi-dimensional points instead of words. Probabilistic semantic indexing (PLSI) is an retrieval method, which learns hidden topics. We develop several mappings from continuous NMR-spectra to discrete text-like data. The new mappings include redundancies into the discrete data, which proofs helpful for the PLSI-model used afterwards. Our experiments show that PLSI, which is designed for text data created by humans, can effectively handle the mapped NMR-data originating from natural products. Additionally, PLSI combined with the new mappings is able to find meaningful ”topics” in the NMR-data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Evaluation of Text Retrieval Methods for Similarity Search of multi-dimensional NMR-Spectra

Searching and mining nuclear magnetic resonance (NMR)spectra of naturally occurring substances is an important task to investigate new potentially useful chemical compounds. Multi-dimensional NMR-spectra are relational objects like documents, but consists of continuous multi-dimensional points called peaks instead of words. We develop several mappings from continuous NMR-spectra to discrete tex...

متن کامل

Median Modified Wiener Filter for nonlinear adaptive spatial denoising of protein NMR multidimensional spectra

Denoising multidimensional NMR-spectra is a fundamental step in NMR protein structure determination. The state-of-the-art method uses wavelet-denoising, which may suffer when applied to non-stationary signals affected by Gaussian-white-noise mixed with strong impulsive artifacts, like those in multi-dimensional NMR-spectra. Regrettably, Wavelet's performance depends on a combinatorial search of...

متن کامل

Effects of Clinacanthus nutans leaf extract on lipopolysaccharide -induced neuroinflammation in rats: A behavioral and 1H NMR-based metabolomics study

Objective: This research revealed the biochemical outcomes of metabolic dysregulation in serum associated with physiological sickness behavior following lipopolysaccharide (LPS)-induced neuroinflammation in rats, and treatment with Clinacanthus nutans (CN). Verification of 1H NMR analysis of the CN aqueous extract proved the existence of bioactive phytochemical constituents’ in extract. Materia...

متن کامل

Choosing the best pulse sequences, acquisition parameters, postacquisition processing strategies, and probes for natural product structure elucidation by NMR spectroscopy.

The relative merits of different pairs of two-dimensional NMR pulse sequences (COSY-90 vs COSY-45, NOESY vs T-ROESY, HSQC vs HMQC, HMBC vs CIGAR, etc.) are compared and recommendations are made for the preferred choice of sequences for natural product structure elucidation. Similar comparisons are made between different selective 1D sequences and the corresponding 2D sequences. Many users of 2D...

متن کامل

HPLC-SPE-NMR: a productivity tool in natural products research

Natural products provide excellent potential leads for drug development because of their chemical diversity and biological functionality. However, the productivity of discovery of new, pharmacologically active natural products has traditionally been low due to inherent difficulties and costs associated with extract dereplication, i.e., isolation, purification and structure elucidation of indivi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006